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SUMMARY 



An attempt is made to develop a method for 
the presentation of temporal and spatial socioling- 
uistic data in order to study phenomena of bilingual 
interaction. 

Domain of language usage and other socio- 
cultural variables are abstracted from extant 
sources, normalized and compared with the aim 
of studying sociolinguistic variation, its reflection 
on stability of language distribution and the 
relation of the latter to convergence processes 
in the languages in contact. 

The possibility of applying sociological- 
geographical models for the diffusion of innovation 
to linguistic diffusion is investigated. 

The case of the Macedorumanian nomads in 
the Balkans is used to illustrate the method. 
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0. Introduction 



Recent and less recent literature have 
extensively covered the topic of bilingualism and 
its role in language change, as well as the necessity 
of introducing sociolingniistic variables into the 
analysis of both. On the other hand, detailed 
sociolinguistic studies are mostly oriented towards 
the investigation of sociolinguistic change, and 
especially language maintenance or shift, so that 
not much exists on sociolinguistic geography. What 
I will attempt to do in this paper, is a synthesis of 
the sociolinguistic and linguistic notions of bilingual- 
ism and their interaction in time and space, which 
precipitates language change. Possible models, 
some of them mathematical, used in sociological 
research of diffusion, will also be discussed for 
possible application to the study of language diffusion. 



1. Bilingualism and language change 



By bilingualism is understood here the 
acquisition by a speaker of elements of language 
other than his mother tongue; under language contact 
I include dialect contact, as well as register or style 
or any such contact, determined by the topic or the 
situation. The three fold distinction. as to bilingual 
ability proposed by Diebold (1961) will be tentatively 
kept. Bilingualism will be distinguished into incipient, 
subordinate and coordinate. As the information used 
for my illustrations derives from documents which 
antedate Diebold’s studies by several decades, vague 
impressionistic statements will be converted into 
Diebold’s three fold distinction according to the 
following scheme; 

a) "speaks language X poorly" s incipient bilingualism. 

b) "speaks perfect (language) X in addition to his mother 
tongue T' • coordinate bilingualism. 
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c) as subordinate would be classed the bilinguals 
making up the difference of the total population 
after a and bj also those said to speak "some 
of language X in addition to their mother tongue Y*' . 



I accept with Haugen (1956) and Diebold the 
three stage influence of one language upon another 
through the bilingual individual. ^ 



1. Code switching : "The alternate use of two 

languages". 

2. Interference ; "The overlapping of two 

languages" . 

3. Integration of interference ; "The regular 

use of material from one language m 
another" , which should occur also in 
the speech of monolingual Individuals. 



2. Areal convergence 



The Prague linguists and notably Jakobson 
(1931, 1938, 1944) and Trubetzkoy (1928) studied 
areas in which several languages in close and 
protracted contact (through large bilingual groups), 
within a restricted geographical space, tend to share 
a high number of linguistic traits, not necessarily 
inherited from a parent speech. In many cases the 
languages are not genetically relat'^d at all. The 
traditional examples are from the Balkans, India, 
the Caucasus and also North America. The studies 
by Trubetzkoy and Jakobson’s early work were 
concerned mostly with the linguistic description of 
features typical of certain such language areas (in 
the Prague terminology Sprach blind e ). This was 



also the case with the works of the American anthro- 
pologists and linguists, Boas and Sapir. Comparatively 
recently the so cio linguistic aspects of multilanguage 
convergence were touched upon by Emeneau in two 
articles (1956, 1961), where both the linguistic 
traits, the convergence process and some of the 
social factors that must have brought it about were 
outlined. Before this study of Areal Convergence 
we have, of course, Weinreich’s basic exposition 
in Languages in Contact (1953), as well as numerous 
later contributions (1957, 1958 etc. ) 



3. The sociolinguistic dimensions of language convergence 



The cases studied by Emeneau (1956, 1961) were 
from India and Pakistan. Among recent linguistic studies 
of convergence should be mentioned Sirokov (1964), 

Kazazis (1964, 1967) and Birnbaum (1966, 1968, 1969), 
of which the last two have proposed a transformational 
treatment. Measures of phonological convergence (Jakobsonian 
distinctive feature analyses, phoneme distribution) were 
proposed by Afendras (1968, 1969b, 1969c). An early 
attempt at an exposition of the sociolinguistic aspects of 
Balkan convergence was outlined in Afendras (1969d), 
in which the work of Emeneau and others were briefly 
discussed. 

Emeneau (1961) first exposes the linguistic 
convergence between Brahui and Beluchi in Beluchistan, 
then investigates the social side of Brahui -Beluchi 
interaction, specifically the relative prestige of the 
two languages, political power distribution between 
the two ethnic groups, intermarriage practices (e.g. 
extent of exogamy), child language acquisition, bilaterality 
of bilingualism (that is extent to which bilinguals are drawn 
from both monolingual pools), and distribution of the two • 
languages by social context. 



4. Sociolinguistic matrices, location in time, space 



Since the time of Emeneau’s study, several 
detailed and rigorous works on sociolinguistics and 
bilingualism have appeared(l). The sociolinguistic 
method can be very helpful even for the historical 
analysis of linguistic convergence, as it exposes 
the ongoing process of interference, first phase 
of convergence. These studies have pointed-out 
the importance of understanding the social 
structure and the relationship between the two 
groups involved. Within monolingual groups, 
distribution of dialects or subdialectal varieties 
according to sociolinguistic context, for instance, 
domains of usage --or attitudes of the code users 
towards the languages employed are keys to change 
phenomena. The concept of diglbssia was early 
introduced by Ferguson to describe a situation in 
which two closely related languages were employed 
in a certain strict distribution over domains of 
daily usage and with some concomitant attitudes 
towards the two languages. Fishman (1967) 
extended the term to cover all cases in which two 
language codes are used in non-overlapping domains, 
irrespective of the linguistic distance between these 
codes. Starting from this notion, Fishman also 
defined a notion of stable bilingualism. The following 
social variables have been isolated by the various 
researchers as sociolinguistically pertinent: social 
class, age, sex , occupation , religion, political 
affiliation, education and place of origin . As domains 
of use we have: the home , the locale, the school , the 
work , religious worship, the neighborhood, etc. 



(1) See in bibliography works of Weinreich, Diebold, 
Ferguson, Fishman, Gumpers, Haugen, Rubin, etc. " 



Naturally, some of these are more pertinent in 
some cases than in others. In the particular example 
which I will be using, that of the Macedorumanian 
nomads in the Balkans, age, sex, profession, religion, 
political affiliation, as well as transactions with the 
state authorities are important. Language use for 
each ethnic groups, domain and role — relationships 
have to be specified. In the case where the extant 
sources provide some possibility for statistical 
estimates, such estimates will be given. The 
information will be presented in matrices, which 
start from the most general, with ethnic group and 
language on the axes; these can be gradually "blown 
up" for their sub-sections to include greater detail. 
Ethnic group, particular place (such as city or village 
concerned), as well as chronological time (1912, 1850, 
etc.) and state in power (e. g. Ottoman Empire, Yugoslav 
State) should be indicated. The first blown up matrices 
will include the larger categories into which domains 
are grouped, with the languages used in each of theme 
(See for example the matrices that follow on pages 
2^-pf.). In the third matricial representation of 
greater resolution, will be given one domain with 
its sub-domains or the role relationships within it, 
and on the other axis, the languages used. The 
source from which the information was extracted 
should always be indicated. Incidentally, this 
particular coding will be used for information 
extracted from extant sources of a historical, 
geographic, ethnological, anthropological, political, 
and other nature, which provide the desired information 
accidentally. Possible matrices with information have 
been constructed by screening a limited number of 
sources and are given in the appendix for illustration 
purposes. As an example of roles within a domain, 
we will take home . On the different cells of a row 
will be indicated the language used for a particular 
row,* such as shown along the cells of each column. 

For the domain of home, then, we have the roles 
of father- mother, father-child, mother-child. 



child- child, and the reverse where pertinent, such 
as child- mother, child-father, and mother-father (2). 
The domain of religion will have to yield the particular 
religion, such as in our case, moslem, jewish, 
Christian orthodox or catholic, orthodox patriarchate, 
as opposed to orthodox exarchate, and then within 
each of those a breakdown by role relationship such 
as priest -people (language of the sermon) or language 
of the ritual part of the service (pries t-God ?). 



5. Diachronic and diatopic presentation 



Most of the studies that have been done so far 
along the socio linguistic vein investigate language 
change as reflected in synchronic socio linguistic 
processes. Few of them have really been diachronic 
over large increments of time and none has been 
along the lines of linguistic geography. (We can call 
such studies monotopic and monochronic.) The above 
presentation paralleled by an equal presentation of 
the linguistic facts in matricial or any other form 
(e. g. the geographic convergence study by Afendras, 
1968, 1969b), can be used to find the covariance or 
correlation between the sociolinguistic variables 
and the linguistic changes. We can then take from 
the sociolinguistic matrices, the same cell across 
different points in space, or the same cell across 
different points in time. We can, for instance, 
follow changes in the language of transaction with 
the authorities, or in the language of religion, or 
in the language distribution and usage at home. 

A geographical distribution can be accompanied 
by texts from the different domains (or if not 
available, any texts that are available) and with 
measures of interference or integration of elements 
of another language. Of course, breakdown by domain 



(2) Instances of linguistically asymmetric role- 
relationships are quite common. 
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for the text samples would be highly desirable, as 
we know that interference varies with domain(3). 

A detailed analysis of the type of linguistic inter- 
ference siich as the type of loaning (translation, 
shifting, direct loaning etc.) or interference on 
the other linguistic levels can then be directly 
correlated with the sociolinguistic dimensions. 



6. Some diachronic-diatopic work in Sociolinguistics 



Emeneau (1961) took into consideration 
several of the sociolinguistic factors in his 
diachronic study of Beluchistan and at the same time 
information from the India census which he used for 
statistical calculations of bilingualism incidence. He 
did not attempt,however , a detailed diachronic 
sociolinguistic study. Rubin’s work on Paraguay . 
concerned the sociolinguistic component but did not 
attempt a joint study of the sociolinguistic and 
linguistic aspects. She did though present some 
sample texts and an analysis of the Guarani Spanish 
elements in them, as well as the Swadesh basic list 
in Guarani, with its Spanish loanwords. 

The work of Sheldon Klein (1966, 1969), 
represents a pioneer effort in synthesizing the socio- 
demographic and linguistic aspects of language change 
and bringing it to the "laboratory". His computer 
simulations reproduce conditions in a small, commu- 
nity (communication networks, events such as birth, 
death and marriage) and with a stochastic model 



(3) Mackey (1962:69 ), Ma and Herasimchuk 

(1968:645). 



"predict" language change towards a certain equi 
librium (choice of particular syntactic rules.) 



7. Nature of sociolinguistic and linguistic data 
on the Macedorumanians 



As mentioned above (section 5), several 
works of diverse nature are available^ which give 
accidental information on the sociolinguistics of 
the Macedorumanian nomads in the Balkans. 

As illustrated in the present paper, this iniormation 
can be coded in rather fashionable attire, in spite 
of its extraction from works of respectable age. 

For data on linguistic variation across space, there 
are available studies of the local dialects, although 
in a rather difficult form (4). The Macedorumanians, 
speaking a language akin to Rumanian, (considered 
by some scholars merely a Sub-Danubian dialect 
of Rumanian) were, until very recently, sheep- 
herding nomads living in the central Balkans . From 
their semi-permanenc mountain villages, (on Pindus, 
Gramos etc. ) they migrated seasonally to the plains 
(mostly in what is now Greek, Albanian, and Yugoslav 
Makedonia, where they spent the fall, winter and 
spring months. During these migrations they came 
into contact with the other populations in the plains. 

As soon as the weather permitted, they returned 
to their mountain villages, where they spent some 
of the spring, summer and fall months. Their 
contacts with the other populations took place 
during numerous religious festivals in some of the 

(4) See works of Weigand, Capidan, Caragiu- 
Mario^eanu etc . 



cities neighboring their areas. Their commercial 
transactions with the surrounding populace were 
intensive and centered around the marketing of 
their products, such as cheese, sheep, blankets 
and related items manufactured in their villages. 
Muleteering throughout the Balkans was also in 
their hands since the 6th Century A. D. (Wace 
and Thomson, 1914-256) and thus they were in 
constant contact with the other ethnic groups . 
From early times, quite a few Macedorumanians 
settled in many of the Balkan cities, notably 
Salonica, Prizren, Skopia, even as far south 
as Athens. Numbers of them also immigrated 
to Rumania, and far away central and east 
European countries(5). Their traditional routes 
of nomadic migration, as well as the intensity 
of the contacts and the time spent in the areas 
where they migrated, have been documented 
with various degrees of accuracy and clarity. 

Due to historical changes in the Balkans in this 
century, their ecology and their pattern of life 
have been upset completely and now after rapid 
acculturation to the dominant states and cultures 
between which they have been divided, they are 
disappearing. Where before within the borders 
of the Turkish state they could migrate across 
plains and mountains, fragmentation of the 
Balkans into the sovereign states of Greece, 
Bulgaria, Albania and Yugoslavia has raised 
impassable political frontiers before their 
nomadic moves. Moreover, the trend towards 
ubarnization and industrialization in the Balkan 



(5) In the recent decades a strong migratory current 
to America has developed, with the Massachusetts 
area as the main target. 



states has attracted rjfiaiiy of them to the cities where 
acculturation is always more rapid. 

The most important sociolinguistic components 
for the study of the Macedorumanians are those of 
religion, sex, profession and locale. The role of 
their women in preserving the paternal language 
and customs has been stressed repeatedly in 
scholarly works this and last century. Therefore, 
the study of their intermarriage patterns (endogamy- 
exogamy) is most important for understanding their 
past and future acculturation process, and more 
specifically their language maintenance. Furthermore, 
since their particular professions were what determined 
contacts with the outside communities of Greek, Slavic 
or Albanian speakers, occupation must form an im- 
portant component of the Macedorumanian sociolinguistic 
matrix. As during the Turkish domination, religion was 
the one factor of classification of non-Turkish subjects, 
national or linguistic criteria played a very small role 
until last century. With the Macedorumanians this 
situation continued well into the 20th century and religion 
was what determined their exogamic practices. Before 
language and ethnic awareness, the intimate connection 
between religion and education in the Balkans in much 
of their history added to the forces of acculturation. 

This pattern changed only recently. 



8. In search of a model: social diffusion and geographic 
migration 



The nature of the subject suggests the application 
of a certain type of mathematical model that has been often 
used in sociological research. Namely tlie models for the 
study of diffusion of innovation, both along space and within 



a social structure, and the migration models seem 
quite appropriate. 



The early models utilised in studies of social 
diffusion were based on a simple theory of mechanical 
contagion, similar to the models for the spread of 
epidemics. It is obvious that social diffusion, and 
even more so linguistic diffusion require much more 
complex models. 



Among the most successful attempts at a 
mathematical model of diffusion, lies the work of 
Rapoport. In the early stages, Rapoport used a very 
simple situation. He assumed a collection of individuals 
distributed in an amorphous social space. A message 
of unspecified content is introduced and then communicated 
by one individual to another in a totally random fashion. 

No structure is assumed whatsoever for this social 
group, no obstacles and no networks of communication. 
Gradually, such conditions (obstacles etc. ) are introduced, 
which complicate the model and approximate reality 
closer. It should be stressed that Rapoport^s model is 
inductive in that observations of actual diffusion served 
to make the theoretical adjustments to his model. The 
following are some of the restrictions introduced to 
the free model outlined above. First, that an individual 
belongs to a group within which he makes all his contacts. 
(This condition will have to be changed in our case as 
what really interests us is the contact with individuals 
belonging to another group. ) Next, a hyix)thesis. is 
made that the number of group members is much 
higher than the contacts an individual can make. Other 
conditions are gradually introduced, such as the 
probability of an individual having been informed 
(contacted) by a certain time -- through another 
individual; the probability of the latter having been 
informed prior to the time of the contact, and so on. 

In our particular instance, the Macedorumanian nomads 



or any other instance of linguistic diffusion(^). 
Additional restraints would have to be introduced, 
reflecting structural resistance, "holes in the 
pattern", Sapir^s "drift" or any other purely 
linguistic forces at work (Klein, 1966:67, 68). 

In this particular case study, some way 
would have to be devised to represent the periodic 
contacts of the Macedorumanians or any other 
group with the group speaking the "other language", 
as well as the intensity of these contacts. This 
introduces a new dimension and social models 
exist which can be immediately applied. The 
new dimension is the distance to the different 
areas to which the Macedorumanians migrated 
periodically (see map of nomadic migrations 
in appendix). The social or geographic models 
to be considered can be utilised in two ways. 

First, for the migration itself, then for the 
probability of mutual influence among the 
languages in contact. The following is an 
illustration with the simplest such model available. 
The model was proposed by Zipf and is one of the 
least known among linguists, although geographers 
and sociologists are quite familiar with itv* ). 

Zipf proposed that the amount of interaction between 
two cities is proportional to P 2 , where Pi and 



P 2 represent the populations of the two cities and D 
represents the closest distance after consideration 



(6) Simple tests of the model can examine the spread 
of particular slang or other technical words within 
a group or across groups. 

(7) It seems that the hypothesis was formulated originally 
long before Zipf, in 1858 (Olsson, 1965:44). 



of all means of transportation. This model is not 
very powerful as it does not take into consideration 
any of the social, economic or administrative factors . 
However, it can be used as a starting point, in 
modeling the language distance factor for incorporation 
into the linguistic diffusion model. P 2 in this case 
could represent number of phonemes or number of 
words in a lexicon, or any such unit. Of course, the 
difficulties involved in this particular application 
because of the structural relationships in language 
are even more severe than those involved in the 
application of the model to the study of migration. 

A more successful model was proposed by Stouffer, 
(1940, 1960) in which the process is examined in ' 
terms of migration of opportunities available to 
the migrant, according to both geographic distance 
and his membership in a particular social class. 

This model is mathematically extremely simpleV®). 
Finally, a very ingenious model displays qualities 
which would render it applicable to linguistic 
studies. It was first proposed by Hftgerstrand in 
a simulation of diffusion of particular cultural 
items in an area in Sweden based on density of 
communication between different points in this 
geographic area. H^gerstrand divides his area 
into geographic cells for which density of population 
and communication was defined on the basis of 
frequency of telephone calls across different 
points and migratory movements of the population. 



(8) In general lines, Stouffer proposed that the number 
of people migrating a certain distance is proportional 
to; opportunities available at that distance 

opportunities available within a lesser distance. 



Thus probability of contact between the different 
cells could be calculated. 

In his later models H&gerstrand introduced 
additional considerations such as, unevenness in 
their population distribution, as well as a variety 
of influences and physical obstacles. The above 
models can be applied to a study of linguistic 
diffusion in cases like that of the Macedorumanians 
but also in any other case of multilingual setting in 
several ways, first of all purely for the study of 
linguistic diffusion as an innovation, second for 
the study of the populations which are involved and 
their migratory patterns, and third it can be 
applied in a rather metaphorical sense to the study 
of the vocabularies. In other words, we can apply the 
migration model but this time its units would take 
not human individuals but words. Here, instead of 
two ethnic groups with social sub-groups, obstacles, 
etc . , we have two word populations, (that is, the 
lexicon of the two languages in contact), with their 
existing sub-structures of words definable by center 
of interest or word class, and so on. (This again is 
analogous to social structure in which classes do 
enter in complex inter-relations. ) A loan then would 
be an "individual" migrating into the other population 
of words, entering another group and being integrated 
into it. This model can be incorporated into the 
sociodemographic model and tested with real data. 

As opportunities we can define the cultural distance 
between the two groups in contact. In the specific 
example of the Macedorumanians, we can study via 
the preceding sociolinguistic analysis, the domains, 
topics, and roles in which code switching onto Greek, 
or Slavic, or Albanian occurs. From these centers 
of interest so defined and lexical sub-structures that 
are related to them, we can thus derive an index of 
opportunity for loan-words into this particular center 
of interest. As data, we can use loan-words in two 
languages in contact as for instance, English and 



French in Canada, Macedorumanian and Greek in 
the Balkans, etc. Creole languages can also be 
analyzed in this particular way. We can perhaps 
derive a function over -time saturation of the 
lexicon of one language with words of another 
language, the two languages being in contact. 



9. Conclusions 



In this paper, different methods of analysis 
of contact situations and diffusion models have been 
reviewed. These can be studied further and combined 
into an overall model of language contact and language 
convergence resulting from bilingualism. The various 
dimensions such as, linguistic structure, socio- 
linguistic distribution, space and time can be incorp- 
orated into the study of ongoing processes, as well as 
prediction future and historical, that is, reconstruct- 
ion of past sociolinguistic situations. 



10. Additional notes 



1. Years of "contact" and possible points for 
word transfer in the two lexicons (corresponding to 
the quotation "opportunities" in the migration model) 
will be defined by a careful analysis of the language 
distribution by domain: this will pinpoint the topic - 
listener - speaker interaction and will enable us to 
assign the opportunity values. 

2. Models of multilingual commimication in 
under-developed and other communities, based on 
the above exposition can be used in language planning 
policy, as they can calculate or define the commun- 
icative efficiency on the social and national level of 
particular linguistic codes. 
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